Engineering posts about Data Lakehouse
Curated summaries and key learnings for engineers working with Data Lakehouse.
The Convergence of Open Table Formats and Open Catalogs: Catalog Commits is Generally Available
The article announces the General Availability of Catalog Commits, a significant enhancement for Delta Lake and Unity Catalog that aims to unify the lakehouse architecture by addressing coordination...
AI Data Transformation Guide for Data Engineers and Data Scientists
The article outlines the critical process of AI data transformation, emphasizing its importance for data engineers and data scientists in converting raw data into structured formats suitable for...
What Are Analytic Applications?
Analytic applications are specialized business intelligence (BI) tools designed to facilitate data-driven decision-making within specific business domains. They integrate data from various...
10 Data Warehouse Migration Myths Blocking AI-readiness (and Your Blueprint for Seamless Modernization)
The article outlines ten prevalent myths surrounding data warehouse migrations that hinder organizations from achieving AI readiness. It emphasizes the importance of viewing migration as a strategic...
Azure Databricks Lakebase is Generally Available
Azure Databricks Lakebase is a managed, serverless PostgreSQL service designed to enhance data architecture by integrating operational capabilities directly into the lakehouse environment on Azure....
Nasdaq eVestment Data Now on Databricks Marketplace
The article presents the availability of Nasdaq eVestment data through Delta Sharing on Databricks Marketplace, enabling asset managers to access live, query-ready institutional investor data. This...
The Marketing Cloud and Adstra deliver identity resolution through Databricks Clean Rooms for secure, privacy-first marketing data collaboration
The Marketing Cloud and Adstra have partnered with Databricks to enhance identity resolution and audience enrichment through the use of Databricks Clean Rooms. This collaboration allows brands to...
Structured vs unstructured data
The article explores the fundamental differences between structured and unstructured data, highlighting the advantages and challenges associated with each type. Structured data is organized within...
Arctic Wolf’s Liquid Clustering Architecture Tuned for Petabyte Scale
Arctic Wolf has implemented a liquid clustering architecture to optimize the processing of over one trillion security events daily, resulting in enhanced query performance and data freshness. By...
BCBS 239 Compliance in the Age of AI: Turning Regulatory Burden into Strategic Advantage
The article explores how financial institutions can leverage Databricks to automate compliance with BCBS 239, a regulatory standard for risk data aggregation and reporting. It highlights the...
Top 10 Questions You Asked About Databricks Clean Rooms, Answered
The article discusses Databricks Clean Rooms, a secure environment for collaborative analysis of sensitive data without exposing raw records. It outlines how organizations can utilize Clean Rooms to...